Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 338592 |
| Missing cells | 337964 |
| Missing cells (%) | 5.0% |
| Duplicate rows | 291922 |
| Duplicate rows (%) | 86.2% |
| Total size in memory | 51.7 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 18 |
| Dataset has 291922 (86.2%) duplicate rows | Duplicates |
SanchiName has a high cardinality: 77 distinct values | High cardinality |
RuikeiHonsyoHeiti is highly correlated with RuikeiSyutokuHeichi | High correlation |
RuikeiHonsyoSyogai is highly correlated with RuikeiFukaSyogai and 1 other fields | High correlation |
RuikeiFukaSyogai is highly correlated with RuikeiHonsyoSyogai and 1 other fields | High correlation |
RuikeiSyutokuHeichi is highly correlated with RuikeiHonsyoHeiti | High correlation |
RuikeiSyutokuSyogai is highly correlated with RuikeiHonsyoSyogai and 1 other fields | High correlation |
SogoChakukaisu2 is highly correlated with ChuoChakukaisu2 | High correlation |
SogoChakukaisu3 is highly correlated with ChuoChakukaisu3 | High correlation |
SogoChakukaisu4 is highly correlated with ChuoChakukaisu4 | High correlation |
SogoChakukaisu5 is highly correlated with ChuoChakukaisu5 | High correlation |
ChuoChakukaisu2 is highly correlated with SogoChakukaisu2 | High correlation |
ChuoChakukaisu3 is highly correlated with SogoChakukaisu3 | High correlation |
ChuoChakukaisu4 is highly correlated with SogoChakukaisu4 | High correlation |
ChuoChakukaisu5 is highly correlated with SogoChakukaisu5 | High correlation |
Syotai has 337964 (99.8%) missing values | Missing |
RuikeiHonsyoSyogai is highly skewed (γ1 = 27.14302994) | Skewed |
RuikeiFukaHeichi is highly skewed (γ1 = 23.30715518) | Skewed |
RuikeiFukaSyogai is highly skewed (γ1 = 32.1404276) | Skewed |
RuikeiSyutokuSyogai is highly skewed (γ1 = 38.92981855) | Skewed |
RuikeiHonsyoHeiti has 57106 (16.9%) zeros | Zeros |
RuikeiHonsyoSyogai has 315907 (93.3%) zeros | Zeros |
RuikeiFukaHeichi has 216858 (64.0%) zeros | Zeros |
RuikeiFukaSyogai has 334777 (98.9%) zeros | Zeros |
RuikeiSyutokuHeichi has 110114 (32.5%) zeros | Zeros |
RuikeiSyutokuSyogai has 324460 (95.8%) zeros | Zeros |
SogoChakukaisu1 has 92437 (27.3%) zeros | Zeros |
SogoChakukaisu2 has 111847 (33.0%) zeros | Zeros |
SogoChakukaisu3 has 104875 (31.0%) zeros | Zeros |
SogoChakukaisu4 has 105235 (31.1%) zeros | Zeros |
SogoChakukaisu5 has 104212 (30.8%) zeros | Zeros |
ChuoChakukaisu1 has 124139 (36.7%) zeros | Zeros |
ChuoChakukaisu2 has 143149 (42.3%) zeros | Zeros |
ChuoChakukaisu3 has 133347 (39.4%) zeros | Zeros |
ChuoChakukaisu4 has 130810 (38.6%) zeros | Zeros |
ChuoChakukaisu5 has 127972 (37.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-04-07 13:10:54.516711 |
|---|---|
| Analysis finished | 2021-04-07 13:12:53.243340 |
| Duration | 1 minute and 58.73 seconds |
| Software version | pandas-profiling v2.11.0 |
| Download configuration | config.yaml |
| Distinct | 23 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 337964 |
| Missing (%) | 99.8% |
| Memory size | 2.6 MiB |
| 笠松 | |
|---|---|
| 愛知 | |
| 佐賀 | |
| 船橋 | |
| 兵庫 | |
| Other values (18) |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.24044586 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1407 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 2 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 北海道 |
|---|---|
| 2nd row | 北海道 |
| 3rd row | 北海道 |
| 4th row | 北海道 |
| 5th row | 北海道 |
| Value | Count | Frequency (%) |
| 笠松 | 142 | < 0.1% |
| 愛知 | 62 | < 0.1% |
| 佐賀 | 62 | < 0.1% |
| 船橋 | 61 | < 0.1% |
| 兵庫 | 58 | < 0.1% |
| 大井 | 54 | < 0.1% |
| 北海道 | 32 | < 0.1% |
| 香港 | 28 | < 0.1% |
| 川崎 | 27 | < 0.1% |
| 浦和 | 19 | < 0.1% |
| Other values (13) | 83 | < 0.1% |
| (Missing) | 337964 |
| Value | Count | Frequency (%) |
| 笠松 | 142 | |
| 佐賀 | 62 | |
| 愛知 | 62 | |
| 船橋 | 61 | |
| 兵庫 | 58 | |
| 大井 | 54 | 8.6% |
| 北海道 | 32 | 5.1% |
| 香港 | 28 | 4.5% |
| 川崎 | 27 | 4.3% |
| 浦和 | 19 | 3.0% |
| Other values (13) | 83 |
Most occurring characters
| Value | Count | Frequency (%) |
| 笠 | 142 | 10.1% |
| 松 | 142 | 10.1% |
| 知 | 67 | 4.8% |
| 愛 | 62 | 4.4% |
| 佐 | 62 | 4.4% |
| 賀 | 62 | 4.4% |
| 船 | 61 | 4.3% |
| 橋 | 61 | 4.3% |
| 兵 | 58 | 4.1% |
| 庫 | 58 | 4.1% |
| Other values (40) | 632 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 1403 | |
| Modifier Letter | 4 | 0.3% |
Most frequent character per category
| Value | Count | Frequency (%) |
| 笠 | 142 | 10.1% |
| 松 | 142 | 10.1% |
| 知 | 67 | 4.8% |
| 愛 | 62 | 4.4% |
| 佐 | 62 | 4.4% |
| 賀 | 62 | 4.4% |
| 船 | 61 | 4.3% |
| 橋 | 61 | 4.3% |
| 兵 | 58 | 4.1% |
| 庫 | 58 | 4.1% |
| Other values (39) | 628 |
| Value | Count | Frequency (%) |
| ー | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Han | 1180 | |
| Katakana | 223 | 15.8% |
| Common | 4 | 0.3% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 笠 | 142 | 12.0% |
| 松 | 142 | 12.0% |
| 知 | 67 | 5.7% |
| 愛 | 62 | 5.3% |
| 佐 | 62 | 5.3% |
| 賀 | 62 | 5.3% |
| 船 | 61 | 5.2% |
| 橋 | 61 | 5.2% |
| 兵 | 58 | 4.9% |
| 庫 | 58 | 4.9% |
| Other values (18) | 405 |
| Value | Count | Frequency (%) |
| ス | 30 | |
| イ | 28 | |
| ラ | 23 | |
| ン | 23 | |
| リ | 20 | |
| フ | 15 | |
| ド | 14 | 6.3% |
| ア | 13 | 5.8% |
| ギ | 13 | 5.8% |
| ツ | 8 | 3.6% |
| Other values (11) | 36 |
| Value | Count | Frequency (%) |
| ー | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| CJK | 1180 | |
| Katakana | 227 | 16.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| 笠 | 142 | 12.0% |
| 松 | 142 | 12.0% |
| 知 | 67 | 5.7% |
| 愛 | 62 | 5.3% |
| 佐 | 62 | 5.3% |
| 賀 | 62 | 5.3% |
| 船 | 61 | 5.2% |
| 橋 | 61 | 5.2% |
| 兵 | 58 | 4.9% |
| 庫 | 58 | 4.9% |
| Other values (18) | 405 |
| Value | Count | Frequency (%) |
| ス | 30 | |
| イ | 28 | |
| ラ | 23 | |
| ン | 23 | |
| リ | 20 | |
| フ | 15 | 6.6% |
| ド | 14 | 6.2% |
| ア | 13 | 5.7% |
| ギ | 13 | 5.7% |
| ツ | 8 | 3.5% |
| Other values (12) | 40 |
BreederCode
Real number (ℝ≥0)
| Distinct | 2711 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 467007.2923 |
|---|---|
| Minimum | 0 |
| Maximum | 993201 |
| Zeros | 82 |
| Zeros (%) | < 0.1% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 33305 |
| Q1 | 301513 |
| median | 400015 |
| Q3 | 710303 |
| 95-th percentile | 910369 |
| Maximum | 993201 |
| Range | 993201 |
| Interquartile range (IQR) | 408790 |
Descriptive statistics
| Standard deviation | 265687.4571 |
|---|---|
| Coefficient of variation (CV) | 0.5689150071 |
| Kurtosis | -0.9659484301 |
| Mean | 467007.2923 |
| Median Absolute Deviation (MAD) | 196710 |
| Skewness | 0.1675955592 |
| Sum | 1.581249331 × 1011 |
| Variance | 7.058982484 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 373126 | 28915 | 8.5% |
| 393126 | 25961 | 7.7% |
| 341126 | 9029 | 2.7% |
| 913124 | 5033 | 1.5% |
| 710303 | 4691 | 1.4% |
| 433129 | 4508 | 1.3% |
| 811540 | 4386 | 1.3% |
| 400018 | 3713 | 1.1% |
| 233071 | 3458 | 1.0% |
| 301513 | 3130 | 0.9% |
| Other values (2701) | 245768 |
| Value | Count | Frequency (%) |
| 0 | 82 | < 0.1% |
| 14 | 2156 | |
| 32 | 237 | 0.1% |
| 42 | 6 | < 0.1% |
| 49 | 283 | 0.1% |
| Value | Count | Frequency (%) |
| 993201 | 19 | |
| 990887 | 6 | < 0.1% |
| 990886 | 1 | < 0.1% |
| 990883 | 1 | < 0.1% |
| 990882 | 3 | < 0.1% |
| Distinct | 77 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
| 新ひだか町 | |
|---|---|
| 浦河町 | |
| 新冠町 | |
| 日高町 | |
| 安平町 | |
| Other values (72) |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.358531802 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1137172 |
|---|---|
| Distinct characters | 93 |
| Distinct categories | 1 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 新冠町 |
|---|---|
| 2nd row | 新冠町 |
| 3rd row | 新冠町 |
| 4th row | 新冠町 |
| 5th row | 新冠町 |
| Value | Count | Frequency (%) |
| 新ひだか町 | 67675 | |
| 浦河町 | 61365 | |
| 新冠町 | 51425 | |
| 日高町 | 48728 | |
| 安平町 | 32508 | |
| 千歳市 | 25958 | 7.7% |
| 白老町 | 7993 | 2.4% |
| 米 | 7929 | 2.3% |
| 平取町 | 6241 | 1.8% |
| 様似町 | 3632 | 1.1% |
| Other values (67) | 25138 | 7.4% |
| Value | Count | Frequency (%) |
| 新ひだか町 | 67675 | |
| 浦河町 | 61365 | |
| 新冠町 | 51425 | |
| 日高町 | 48728 | |
| 安平町 | 32508 | |
| 千歳市 | 25958 | 7.7% |
| 白老町 | 7993 | 2.4% |
| 米 | 7929 | 2.3% |
| 平取町 | 6241 | 1.8% |
| 様似町 | 3632 | 1.1% |
| Other values (67) | 25138 | 7.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 町 | 295750 | |
| 新 | 119386 | |
| か | 71385 | 6.3% |
| ひ | 67802 | 6.0% |
| だ | 67802 | 6.0% |
| 浦 | 61919 | 5.4% |
| 河 | 61468 | 5.4% |
| 冠 | 51493 | 4.5% |
| 日 | 48827 | 4.3% |
| 高 | 48827 | 4.3% |
| Other values (83) | 242513 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 1137172 |
Most frequent character per category
| Value | Count | Frequency (%) |
| 町 | 295750 | |
| 新 | 119386 | |
| か | 71385 | 6.3% |
| ひ | 67802 | 6.0% |
| だ | 67802 | 6.0% |
| 浦 | 61919 | 5.4% |
| 河 | 61468 | 5.4% |
| 冠 | 51493 | 4.5% |
| 日 | 48827 | 4.3% |
| 高 | 48827 | 4.3% |
| Other values (83) | 242513 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Han | 919687 | |
| Hiragana | 217485 | 19.1% |
Most frequent character per script
| Value | Count | Frequency (%) |
| 町 | 295750 | |
| 新 | 119386 | |
| 浦 | 61919 | 6.7% |
| 河 | 61468 | 6.7% |
| 冠 | 51493 | 5.6% |
| 日 | 48827 | 5.3% |
| 高 | 48827 | 5.3% |
| 平 | 38775 | 4.2% |
| 安 | 32518 | 3.5% |
| 市 | 27069 | 2.9% |
| Other values (75) | 133655 |
| Value | Count | Frequency (%) |
| か | 71385 | |
| ひ | 67802 | |
| だ | 67802 | |
| む | 3583 | 1.6% |
| わ | 3583 | 1.6% |
| え | 1110 | 0.5% |
| り | 1110 | 0.5% |
| も | 1110 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| CJK | 919687 | |
| Hiragana | 217485 | 19.1% |
Most frequent character per block
| Value | Count | Frequency (%) |
| 町 | 295750 | |
| 新 | 119386 | |
| 浦 | 61919 | 6.7% |
| 河 | 61468 | 6.7% |
| 冠 | 51493 | 5.6% |
| 日 | 48827 | 5.3% |
| 高 | 48827 | 5.3% |
| 平 | 38775 | 4.2% |
| 安 | 32518 | 3.5% |
| 市 | 27069 | 2.9% |
| Other values (75) | 133655 |
| Value | Count | Frequency (%) |
| か | 71385 | |
| ひ | 67802 | |
| だ | 67802 | |
| む | 3583 | 1.6% |
| わ | 3583 | 1.6% |
| え | 1110 | 0.5% |
| り | 1110 | 0.5% |
| も | 1110 | 0.5% |
| Distinct | 7081 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 334569.6802 |
|---|---|
| Minimum | 0 |
| Maximum | 18132000 |
| Zeros | 57106 |
| Zeros (%) | 16.9% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 18000 |
| median | 136000 |
| Q3 | 429400 |
| 95-th percentile | 1235500 |
| Maximum | 18132000 |
| Range | 18132000 |
| Interquartile range (IQR) | 411400 |
Descriptive statistics
| Standard deviation | 603552.295 |
|---|---|
| Coefficient of variation (CV) | 1.803965902 |
| Kurtosis | 106.9730932 |
| Mean | 334569.6802 |
| Median Absolute Deviation (MAD) | 136000 |
| Skewness | 7.114155976 |
| Sum | 1.132826172 × 1011 |
| Variance | 3.642753728 × 1011 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 57106 | 16.9% |
| 5000 | 4875 | 1.4% |
| 7500 | 3056 | 0.9% |
| 70000 | 2595 | 0.8% |
| 18000 | 2347 | 0.7% |
| 13000 | 2205 | 0.7% |
| 7000 | 2176 | 0.6% |
| 11000 | 2138 | 0.6% |
| 50000 | 2113 | 0.6% |
| 20000 | 1591 | 0.5% |
| Other values (7071) | 258390 |
| Value | Count | Frequency (%) |
| 0 | 57106 | |
| 2400 | 13 | < 0.1% |
| 2500 | 21 | < 0.1% |
| 2550 | 7 | < 0.1% |
| 3500 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 18132000 | 17 | |
| 14458000 | 14 | |
| 13082000 | 7 | |
| 13058000 | 9 | |
| 12519000 | 17 |
| Distinct | 727 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14852.01289 |
|---|---|
| Minimum | 0 |
| Maximum | 7506000 |
| Zeros | 315907 |
| Zeros (%) | 93.3% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 43000 |
| Maximum | 7506000 |
| Range | 7506000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 124414.6578 |
|---|---|
| Coefficient of variation (CV) | 8.376955954 |
| Kurtosis | 1230.613327 |
| Mean | 14852.01289 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.14302994 |
| Sum | 5028772750 |
| Variance | 1.547900709 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 315907 | |
| 7000 | 556 | 0.2% |
| 70000 | 551 | 0.2% |
| 11000 | 537 | 0.2% |
| 7800 | 518 | 0.2% |
| 18000 | 432 | 0.1% |
| 78000 | 348 | 0.1% |
| 12000 | 314 | 0.1% |
| 20000 | 242 | 0.1% |
| 98000 | 236 | 0.1% |
| Other values (717) | 18951 | 5.6% |
| Value | Count | Frequency (%) |
| 0 | 315907 | |
| 7000 | 556 | 0.2% |
| 7300 | 95 | < 0.1% |
| 7500 | 118 | < 0.1% |
| 7800 | 518 | 0.2% |
| Value | Count | Frequency (%) |
| 7506000 | 26 | |
| 4559000 | 9 | < 0.1% |
| 4538000 | 5 | < 0.1% |
| 4167000 | 8 | < 0.1% |
| 4023000 | 4 | < 0.1% |
| Distinct | 2327 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4012.004684 |
|---|---|
| Minimum | 0 |
| Maximum | 1218840 |
| Zeros | 216858 |
| Zeros (%) | 64.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2450 |
| 95-th percentile | 15670 |
| Maximum | 1218840 |
| Range | 1218840 |
| Interquartile range (IQR) | 2450 |
Descriptive statistics
| Standard deviation | 22605.87382 |
|---|---|
| Coefficient of variation (CV) | 5.634558182 |
| Kurtosis | 784.0498938 |
| Mean | 4012.004684 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.30715518 |
| Sum | 1358432690 |
| Variance | 511025531.2 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 216858 | |
| 500 | 603 | 0.2% |
| 1100 | 582 | 0.2% |
| 1000 | 577 | 0.2% |
| 530 | 540 | 0.2% |
| 550 | 515 | 0.2% |
| 580 | 513 | 0.2% |
| 1080 | 506 | 0.1% |
| 520 | 497 | 0.1% |
| 1040 | 472 | 0.1% |
| Other values (2317) | 116929 |
| Value | Count | Frequency (%) |
| 0 | 216858 | |
| 130 | 24 | < 0.1% |
| 160 | 19 | < 0.1% |
| 170 | 3 | < 0.1% |
| 180 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1218840 | 15 | |
| 895670 | 7 | |
| 821100 | 7 | |
| 806330 | 9 | |
| 743100 | 17 |
| Distinct | 186 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.68510774 |
|---|---|
| Minimum | 0 |
| Maximum | 53370 |
| Zeros | 334777 |
| Zeros (%) | 98.9% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 53370 |
| Range | 53370 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 929.9900656 |
|---|---|
| Coefficient of variation (CV) | 17.00627655 |
| Kurtosis | 1415.743534 |
| Mean | 54.68510774 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 32.1404276 |
| Sum | 18515940 |
| Variance | 864881.522 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 334777 | |
| 420 | 159 | < 0.1% |
| 840 | 140 | < 0.1% |
| 430 | 116 | < 0.1% |
| 860 | 108 | < 0.1% |
| 710 | 84 | < 0.1% |
| 3080 | 66 | < 0.1% |
| 340 | 63 | < 0.1% |
| 1460 | 58 | < 0.1% |
| 4970 | 52 | < 0.1% |
| Other values (176) | 2969 | 0.9% |
| Value | Count | Frequency (%) |
| 0 | 334777 | |
| 250 | 18 | < 0.1% |
| 290 | 5 | < 0.1% |
| 300 | 22 | < 0.1% |
| 310 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 53370 | 26 | |
| 53000 | 9 | < 0.1% |
| 33960 | 9 | < 0.1% |
| 33570 | 5 | < 0.1% |
| 31790 | 18 |
| Distinct | 898 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 89162.66185 |
|---|---|
| Minimum | 0 |
| Maximum | 9068000 |
| Zeros | 110114 |
| Zeros (%) | 32.5% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 40000 |
| Q3 | 95000 |
| 95-th percentile | 330000 |
| Maximum | 9068000 |
| Range | 9068000 |
| Interquartile range (IQR) | 95000 |
Descriptive statistics
| Standard deviation | 223941.2681 |
|---|---|
| Coefficient of variation (CV) | 2.511603663 |
| Kurtosis | 275.4020308 |
| Mean | 89162.66185 |
| Median Absolute Deviation (MAD) | 40000 |
| Skewness | 12.21341436 |
| Sum | 3.0189764 × 1010 |
| Variance | 5.014969154 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 110114 | |
| 20000 | 44496 | |
| 95000 | 21472 | 6.3% |
| 40000 | 18564 | 5.5% |
| 70000 | 16807 | 5.0% |
| 45000 | 14660 | 4.3% |
| 90000 | 12583 | 3.7% |
| 155000 | 12443 | 3.7% |
| 135000 | 6801 | 2.0% |
| 150000 | 5942 | 1.8% |
| Other values (888) | 74710 |
| Value | Count | Frequency (%) |
| 0 | 110114 | |
| 500 | 45 | < 0.1% |
| 1000 | 1054 | 0.3% |
| 1500 | 840 | 0.2% |
| 2000 | 1624 | 0.5% |
| Value | Count | Frequency (%) |
| 9068000 | 14 | |
| 7062500 | 17 | |
| 6113000 | 17 | |
| 5408500 | 15 | |
| 5076000 | 9 |
| Distinct | 85 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4493.520225 |
|---|---|
| Minimum | 0 |
| Maximum | 3640000 |
| Zeros | 324460 |
| Zeros (%) | 95.8% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 3640000 |
| Range | 3640000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 50848.83189 |
|---|---|
| Coefficient of variation (CV) | 11.31603494 |
| Kurtosis | 2286.829971 |
| Mean | 4493.520225 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 38.92981855 |
| Sum | 1521470000 |
| Variance | 2585603704 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 324460 | |
| 40000 | 9707 | 2.9% |
| 100000 | 1433 | 0.4% |
| 160000 | 600 | 0.2% |
| 115000 | 314 | 0.1% |
| 110000 | 149 | < 0.1% |
| 95000 | 114 | < 0.1% |
| 175000 | 95 | < 0.1% |
| 240000 | 92 | < 0.1% |
| 305000 | 71 | < 0.1% |
| Other values (75) | 1557 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 324460 | |
| 40000 | 9707 | 2.9% |
| 95000 | 114 | < 0.1% |
| 100000 | 1433 | 0.4% |
| 105000 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 3640000 | 26 | |
| 2115000 | 5 | < 0.1% |
| 2070000 | 9 | < 0.1% |
| 1825000 | 4 | < 0.1% |
| 1810000 | 8 | < 0.1% |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.026512735 |
|---|---|
| Minimum | 0 |
| Maximum | 26 |
| Zeros | 92437 |
| Zeros (%) | 27.3% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.970036867 |
|---|---|
| Coefficient of variation (CV) | 0.9721315012 |
| Kurtosis | 2.795971226 |
| Mean | 2.026512735 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.278915461 |
| Sum | 686161 |
| Variance | 3.881045259 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 92437 | |
| 1 | 69887 | |
| 2 | 57367 | |
| 3 | 49525 | |
| 4 | 32839 | 9.7% |
| 5 | 18273 | 5.4% |
| 6 | 8570 | 2.5% |
| 7 | 4728 | 1.4% |
| 8 | 2424 | 0.7% |
| 9 | 1033 | 0.3% |
| Other values (14) | 1509 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 92437 | |
| 1 | 69887 | |
| 2 | 57367 | |
| 3 | 49525 | |
| 4 | 32839 | 9.7% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 22 | 3 | |
| 21 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 19 | 6 |
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.918928976 |
|---|---|
| Minimum | 0 |
| Maximum | 18 |
| Zeros | 111847 |
| Zeros (%) | 33.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.130110294 |
|---|---|
| Coefficient of variation (CV) | 1.110051659 |
| Kurtosis | 2.443768864 |
| Mean | 1.918928976 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.443317206 |
| Sum | 649734 |
| Variance | 4.537369866 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 111847 | |
| 1 | 70409 | |
| 2 | 52178 | |
| 3 | 37945 | 11.2% |
| 4 | 24893 | 7.4% |
| 5 | 16224 | 4.8% |
| 6 | 11037 | 3.3% |
| 7 | 6721 | 2.0% |
| 8 | 3531 | 1.0% |
| 9 | 1636 | 0.5% |
| Other values (8) | 2171 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 111847 | |
| 1 | 70409 | |
| 2 | 52178 | |
| 3 | 37945 | 11.2% |
| 4 | 24893 | 7.4% |
| Value | Count | Frequency (%) |
| 18 | 8 | < 0.1% |
| 17 | 2 | < 0.1% |
| 15 | 47 | < 0.1% |
| 14 | 61 | < 0.1% |
| 13 | 183 |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.844393843 |
|---|---|
| Minimum | 0 |
| Maximum | 17 |
| Zeros | 104875 |
| Zeros (%) | 31.0% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.963390025 |
|---|---|
| Coefficient of variation (CV) | 1.064517773 |
| Kurtosis | 2.370173615 |
| Mean | 1.844393843 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.397463253 |
| Sum | 624497 |
| Variance | 3.854900391 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 104875 | |
| 1 | 76982 | |
| 2 | 57608 | |
| 3 | 39159 | 11.6% |
| 4 | 25580 | 7.6% |
| 5 | 14874 | 4.4% |
| 6 | 8834 | 2.6% |
| 7 | 5310 | 1.6% |
| 8 | 2763 | 0.8% |
| 9 | 1459 | 0.4% |
| Other values (7) | 1148 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 104875 | |
| 1 | 76982 | |
| 2 | 57608 | |
| 3 | 39159 | 11.6% |
| 4 | 25580 | 7.6% |
| Value | Count | Frequency (%) |
| 17 | 13 | < 0.1% |
| 15 | 18 | < 0.1% |
| 14 | 18 | < 0.1% |
| 13 | 78 | |
| 12 | 180 |
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.777679331 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros | 105235 |
| Zeros (%) | 31.1% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.902520452 |
|---|---|
| Coefficient of variation (CV) | 1.070227019 |
| Kurtosis | 3.341594241 |
| Mean | 1.777679331 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.496419203 |
| Sum | 601908 |
| Variance | 3.619584068 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 105235 | |
| 1 | 80862 | |
| 2 | 58672 | |
| 3 | 38340 | 11.3% |
| 4 | 23939 | 7.1% |
| 5 | 14131 | 4.2% |
| 6 | 8726 | 2.6% |
| 7 | 4267 | 1.3% |
| 8 | 2213 | 0.7% |
| 9 | 1156 | 0.3% |
| Other values (9) | 1051 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 105235 | |
| 1 | 80862 | |
| 2 | 58672 | |
| 3 | 38340 | 11.3% |
| 4 | 23939 | 7.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 49 | |
| 17 | 3 | < 0.1% |
| 16 | 2 | < 0.1% |
| 14 | 54 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.708791111 |
|---|---|
| Minimum | 0 |
| Maximum | 16 |
| Zeros | 104212 |
| Zeros (%) | 30.8% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.815439766 |
|---|---|
| Coefficient of variation (CV) | 1.062411756 |
| Kurtosis | 3.096025761 |
| Mean | 1.708791111 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.49465968 |
| Sum | 578583 |
| Variance | 3.295821544 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 104212 | |
| 1 | 86087 | |
| 2 | 58785 | |
| 3 | 39537 | 11.7% |
| 4 | 22667 | 6.7% |
| 5 | 13180 | 3.9% |
| 6 | 6957 | 2.1% |
| 7 | 3472 | 1.0% |
| 8 | 1685 | 0.5% |
| 9 | 937 | 0.3% |
| Other values (6) | 1073 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 104212 | |
| 1 | 86087 | |
| 2 | 58785 | |
| 3 | 39537 | 11.7% |
| 4 | 22667 | 6.7% |
| Value | Count | Frequency (%) |
| 16 | 12 | < 0.1% |
| 15 | 2 | < 0.1% |
| 13 | 158 | |
| 12 | 99 | < 0.1% |
| 11 | 304 |
SogoChakukaisu6
Real number (ℝ≥0)
| Distinct | 65 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.81584326 |
|---|---|
| Minimum | 0 |
| Maximum | 89 |
| Zeros | 2637 |
| Zeros (%) | 0.8% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 10 |
| Q3 | 16 |
| 95-th percentile | 28 |
| Maximum | 89 |
| Range | 89 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.223059731 |
|---|---|
| Coefficient of variation (CV) | 0.6959350723 |
| Kurtosis | 2.980408519 |
| Mean | 11.81584326 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.340479696 |
| Sum | 4000750 |
| Variance | 67.61871134 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 21785 | 6.4% |
| 6 | 21232 | 6.3% |
| 8 | 19737 | 5.8% |
| 4 | 19551 | 5.8% |
| 7 | 19490 | 5.8% |
| 9 | 18224 | 5.4% |
| 3 | 17472 | 5.2% |
| 10 | 17146 | 5.1% |
| 11 | 15974 | 4.7% |
| 13 | 14444 | 4.3% |
| Other values (55) | 153537 |
| Value | Count | Frequency (%) |
| 0 | 2637 | 0.8% |
| 1 | 7276 | 2.1% |
| 2 | 12666 | |
| 3 | 17472 | |
| 4 | 19551 |
| Value | Count | Frequency (%) |
| 89 | 19 | < 0.1% |
| 71 | 50 | |
| 69 | 2 | < 0.1% |
| 67 | 28 | |
| 61 | 42 |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.672210212 |
|---|---|
| Minimum | 0 |
| Maximum | 18 |
| Zeros | 124139 |
| Zeros (%) | 36.7% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.835322284 |
|---|---|
| Coefficient of variation (CV) | 1.097542804 |
| Kurtosis | 1.68257104 |
| Mean | 1.672210212 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.222755774 |
| Sum | 566197 |
| Variance | 3.368407887 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 124139 | |
| 1 | 68198 | |
| 2 | 48772 | 14.4% |
| 3 | 41321 | 12.2% |
| 4 | 28017 | 8.3% |
| 5 | 15402 | 4.5% |
| 6 | 6495 | 1.9% |
| 7 | 3603 | 1.1% |
| 8 | 1450 | 0.4% |
| 9 | 725 | 0.2% |
| Other values (5) | 470 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 124139 | |
| 1 | 68198 | |
| 2 | 48772 | 14.4% |
| 3 | 41321 | 12.2% |
| 4 | 28017 | 8.3% |
| Value | Count | Frequency (%) |
| 18 | 26 | < 0.1% |
| 13 | 7 | < 0.1% |
| 12 | 34 | < 0.1% |
| 11 | 101 | < 0.1% |
| 10 | 302 |
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.636406058 |
|---|---|
| Minimum | 0 |
| Maximum | 18 |
| Zeros | 143149 |
| Zeros (%) | 42.3% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.055952412 |
|---|---|
| Coefficient of variation (CV) | 1.256382792 |
| Kurtosis | 2.897165962 |
| Mean | 1.636406058 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.589431578 |
| Sum | 554074 |
| Variance | 4.22694032 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 143149 | |
| 1 | 61084 | |
| 2 | 45486 | 13.4% |
| 3 | 32884 | 9.7% |
| 4 | 20840 | 6.2% |
| 5 | 14596 | 4.3% |
| 6 | 9273 | 2.7% |
| 7 | 5471 | 1.6% |
| 8 | 2791 | 0.8% |
| 9 | 1421 | 0.4% |
| Other values (7) | 1597 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 143149 | |
| 1 | 61084 | |
| 2 | 45486 | 13.4% |
| 3 | 32884 | 9.7% |
| 4 | 20840 | 6.2% |
| Value | Count | Frequency (%) |
| 18 | 8 | < 0.1% |
| 15 | 38 | < 0.1% |
| 14 | 45 | < 0.1% |
| 13 | 114 | < 0.1% |
| 12 | 361 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.581546522 |
|---|---|
| Minimum | 0 |
| Maximum | 13 |
| Zeros | 133347 |
| Zeros (%) | 39.4% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.889299417 |
|---|---|
| Coefficient of variation (CV) | 1.194589846 |
| Kurtosis | 2.490329105 |
| Mean | 1.581546522 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.503533842 |
| Sum | 535499 |
| Variance | 3.569452286 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 133347 | |
| 1 | 70466 | |
| 2 | 50779 | 15.0% |
| 3 | 33472 | 9.9% |
| 4 | 21710 | 6.4% |
| 5 | 12750 | 3.8% |
| 6 | 7670 | 2.3% |
| 7 | 4288 | 1.3% |
| 8 | 1978 | 0.6% |
| 9 | 1407 | 0.4% |
| Other values (4) | 725 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 133347 | |
| 1 | 70466 | |
| 2 | 50779 | 15.0% |
| 3 | 33472 | 9.9% |
| 4 | 21710 | 6.4% |
| Value | Count | Frequency (%) |
| 13 | 53 | < 0.1% |
| 12 | 168 | < 0.1% |
| 11 | 201 | 0.1% |
| 10 | 303 | 0.1% |
| 9 | 1407 |
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.53417978 |
|---|---|
| Minimum | 0 |
| Maximum | 19 |
| Zeros | 130810 |
| Zeros (%) | 38.6% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.820622714 |
|---|---|
| Coefficient of variation (CV) | 1.186707541 |
| Kurtosis | 3.705495842 |
| Mean | 1.53417978 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.608738684 |
| Sum | 519461 |
| Variance | 3.314667069 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 130810 | |
| 1 | 75520 | |
| 2 | 52535 | |
| 3 | 33671 | 9.9% |
| 4 | 19983 | 5.9% |
| 5 | 11814 | 3.5% |
| 6 | 7594 | 2.2% |
| 7 | 3289 | 1.0% |
| 8 | 1822 | 0.5% |
| 9 | 835 | 0.2% |
| Other values (5) | 719 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 130810 | |
| 1 | 75520 | |
| 2 | 52535 | |
| 3 | 33671 | 9.9% |
| 4 | 19983 | 5.9% |
| Value | Count | Frequency (%) |
| 19 | 49 | < 0.1% |
| 13 | 57 | < 0.1% |
| 12 | 90 | < 0.1% |
| 11 | 196 | |
| 10 | 327 |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.479526982 |
|---|---|
| Minimum | 0 |
| Maximum | 13 |
| Zeros | 127972 |
| Zeros (%) | 37.8% |
| Memory size | 2.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.73247967 |
|---|---|
| Coefficient of variation (CV) | 1.170968621 |
| Kurtosis | 3.27454378 |
| Mean | 1.479526982 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.592288163 |
| Sum | 500956 |
| Variance | 3.001485807 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 127972 | |
| 1 | 82401 | |
| 2 | 52329 | |
| 3 | 34377 | 10.2% |
| 4 | 19243 | 5.7% |
| 5 | 11222 | 3.3% |
| 6 | 5482 | 1.6% |
| 7 | 2698 | 0.8% |
| 8 | 1291 | 0.4% |
| 9 | 794 | 0.2% |
| Other values (4) | 783 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 127972 | |
| 1 | 82401 | |
| 2 | 52329 | |
| 3 | 34377 | 10.2% |
| 4 | 19243 | 5.7% |
| Value | Count | Frequency (%) |
| 13 | 35 | < 0.1% |
| 12 | 131 | < 0.1% |
| 11 | 235 | 0.1% |
| 10 | 382 | |
| 9 | 794 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Syotai | BreederCode | SanchiName | RuikeiHonsyoHeiti | RuikeiHonsyoSyogai | RuikeiFukaHeichi | RuikeiFukaSyogai | RuikeiSyutokuHeichi | RuikeiSyutokuSyogai | SogoChakukaisu1 | SogoChakukaisu2 | SogoChakukaisu3 | SogoChakukaisu4 | SogoChakukaisu5 | SogoChakukaisu6 | ChuoChakukaisu1 | ChuoChakukaisu2 | ChuoChakukaisu3 | ChuoChakukaisu4 | ChuoChakukaisu5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | NaN | 530331 | 新冠町 | 350100 | 0 | 0 | 0 | 75000 | 0 | 3 | 3 | 0 | 1 | 3 | 8 | 3 | 3 | 0 | 1 | 3 |
| 1 | NaN | 530331 | 新冠町 | 350100 | 0 | 0 | 0 | 75000 | 0 | 3 | 3 | 0 | 1 | 3 | 8 | 3 | 3 | 0 | 1 | 3 |
| 2 | NaN | 530331 | 新冠町 | 350100 | 0 | 0 | 0 | 75000 | 0 | 3 | 3 | 0 | 1 | 3 | 8 | 3 | 3 | 0 | 1 | 3 |
| 3 | NaN | 530331 | 新冠町 | 350100 | 0 | 0 | 0 | 75000 | 0 | 3 | 3 | 0 | 1 | 3 | 8 | 3 | 3 | 0 | 1 | 3 |
| 4 | NaN | 400317 | 新冠町 | 500700 | 0 | 1720 | 0 | 95000 | 0 | 3 | 3 | 4 | 3 | 2 | 9 | 3 | 3 | 4 | 3 | 2 |
| 5 | NaN | 400317 | 新冠町 | 500700 | 0 | 1720 | 0 | 95000 | 0 | 3 | 3 | 4 | 3 | 2 | 9 | 3 | 3 | 4 | 3 | 2 |
| 6 | NaN | 400317 | 新冠町 | 500700 | 0 | 1720 | 0 | 95000 | 0 | 3 | 3 | 4 | 3 | 2 | 9 | 3 | 3 | 4 | 3 | 2 |
| 7 | NaN | 400317 | 新冠町 | 500700 | 0 | 1720 | 0 | 95000 | 0 | 3 | 3 | 4 | 3 | 2 | 9 | 3 | 3 | 4 | 3 | 2 |
| 8 | NaN | 400317 | 新冠町 | 500700 | 0 | 1720 | 0 | 95000 | 0 | 3 | 3 | 4 | 3 | 2 | 9 | 3 | 3 | 4 | 3 | 2 |
| 9 | NaN | 400317 | 新冠町 | 500700 | 0 | 1720 | 0 | 95000 | 0 | 3 | 3 | 4 | 3 | 2 | 9 | 3 | 3 | 4 | 3 | 2 |
Last rows
| Syotai | BreederCode | SanchiName | RuikeiHonsyoHeiti | RuikeiHonsyoSyogai | RuikeiFukaHeichi | RuikeiFukaSyogai | RuikeiSyutokuHeichi | RuikeiSyutokuSyogai | SogoChakukaisu1 | SogoChakukaisu2 | SogoChakukaisu3 | SogoChakukaisu4 | SogoChakukaisu5 | SogoChakukaisu6 | ChuoChakukaisu1 | ChuoChakukaisu2 | ChuoChakukaisu3 | ChuoChakukaisu4 | ChuoChakukaisu5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 338582 | 佐賀 | 950570 | 熊本 | 35000 | 0 | 570 | 0 | 19000 | 0 | 3 | 0 | 2 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| 338583 | 佐賀 | 513174 | 鹿児島 | 0 | 0 | 0 | 0 | 5000 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 338584 | NaN | 700014 | 浦河町 | 24000 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 3 | 0 | 1 | 0 | 0 | 0 |
| 338585 | NaN | 393126 | 千歳市 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 338586 | NaN | 100046 | 浦河町 | 92000 | 0 | 0 | 0 | 40000 | 0 | 1 | 0 | 0 | 2 | 0 | 3 | 1 | 0 | 0 | 2 | 0 |
| 338587 | 佐賀 | 300337 | 新冠 | 0 | 0 | 0 | 0 | 15000 | 0 | 1 | 0 | 1 | 0 | 2 | 14 | 0 | 0 | 0 | 0 | 0 |
| 338588 | NaN | 540705 | 愛 | 913000 | 0 | 7040 | 0 | 320000 | 0 | 4 | 2 | 3 | 3 | 0 | 12 | 4 | 2 | 3 | 3 | 0 |
| 338589 | NaN | 500319 | 新冠町 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 |
| 338590 | NaN | 373126 | 安平町 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 5 | 0 | 0 | 0 | 0 | 0 |
| 338591 | NaN | 130040 | 浦河町 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
Most frequent
| Syotai | BreederCode | SanchiName | RuikeiHonsyoHeiti | RuikeiHonsyoSyogai | RuikeiFukaHeichi | RuikeiFukaSyogai | RuikeiSyutokuHeichi | RuikeiSyutokuSyogai | SogoChakukaisu1 | SogoChakukaisu2 | SogoChakukaisu3 | SogoChakukaisu4 | SogoChakukaisu5 | SogoChakukaisu6 | ChuoChakukaisu1 | ChuoChakukaisu2 | ChuoChakukaisu3 | ChuoChakukaisu4 | ChuoChakukaisu5 | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 46 | 川崎 | 533423 | 日高 | 110000 | 0 | 590 | 0 | 31000 | 0 | 3 | 3 | 3 | 2 | 1 | 11 | 0 | 0 | 2 | 1 | 1 | 6 |
| 72 | 笠松 | 100004 | 様似 | 0 | 0 | 0 | 0 | 8500 | 0 | 2 | 0 | 2 | 1 | 2 | 15 | 0 | 0 | 0 | 0 | 0 | 5 |
| 73 | 笠松 | 130055 | 浦河 | 0 | 0 | 0 | 0 | 9000 | 0 | 2 | 0 | 4 | 2 | 2 | 18 | 0 | 0 | 0 | 0 | 0 | 5 |
| 91 | 笠松 | 633087 | 新ひだか | 0 | 0 | 0 | 0 | 11000 | 0 | 4 | 0 | 0 | 1 | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 5 |
| 68 | 笠松 | 3512 | 日高 | 0 | 0 | 0 | 0 | 5500 | 0 | 2 | 6 | 3 | 1 | 1 | 16 | 0 | 0 | 0 | 0 | 0 | 4 |
| 71 | 笠松 | 33468 | 日高 | 0 | 0 | 0 | 0 | 10500 | 0 | 5 | 2 | 2 | 0 | 5 | 20 | 0 | 0 | 0 | 0 | 0 | 4 |
| 101 | 笠松 | 803029 | 新ひだか | 0 | 0 | 0 | 0 | 14000 | 0 | 4 | 1 | 5 | 3 | 1 | 13 | 0 | 0 | 0 | 0 | 0 | 4 |
| 104 | 笠松 | 900011 | 浦河 | 0 | 0 | 0 | 0 | 8000 | 0 | 2 | 2 | 4 | 1 | 4 | 20 | 0 | 0 | 0 | 0 | 0 | 4 |
| 122 | 香港 | 0 | 愛 | 420000 | 0 | 3120 | 0 | 0 | 0 | 10 | 7 | 5 | 0 | 3 | 7 | 0 | 1 | 0 | 0 | 2 | 4 |
| 124 | 香港 | 0 | 新 | 230000 | 0 | 3440 | 0 | 0 | 0 | 6 | 5 | 4 | 7 | 1 | 17 | 0 | 1 | 0 | 0 | 0 | 4 |